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In  surveying  the  literature  on  the  errors- in- the-variables  (EV) 
model,  we  are  struck  by  the  omission  of  two  important  aspects  of  the 
model.  It  is  a  well  known  fact  that  measurement  error  in  one 
explanatory  variable  yields  an  asymptotically  downward  biased  OLS 
estimator  of  the  coefficient.  It  is  equally  well  known  that 
measurement  error  in  the  dependent  variable  leads  to  a  reduction  in 
the  power  of  the  usual  t-statistic  in  testing  the  null  hypothesis  of 
no  relationship  between  the  variables,  even  though  the  point  esti¬ 
mates  yielded  are  unbiased.  However,  the  effects  on  the  power  of 
the  t-test  when  the  explanatory  variables  in  an  OLS  regression 
contain  measurement  errors  are  not  generally  known.  Moreover,  the 
case  In  which  there  is  measurement  error  in  more  than  one  explanatory 
variable  has  not  been  adequately  treated. 

In  this  paper  we  are  concerned  with  the  effects  of  measurement 
errors  in  one  or  more  explanatory  variables  upon  the  asymptotic  bias 

of  the  coefficients  and  the  t-tests.  We  first  look  at  a  simple 

♦  .  i _ _  _ _ _ 

regression  model  with  one  explanatory  variable.  We  do  this  because 
the  reader  will  find  it  somewhat  easier  to  follow  the  logic  in  the 
case  of  a  single  explanatory  variable.  Next  we  consider  a  mod-sl  with 
two  explanatory  variables.  Finally,  we  are  able  to  expand  to  the 
n-variable  model  by  way.  of  partitioning. 

*Any  views  expressed  in  this  paper  are  those  of  the  author. 

They  should  not  be  interpreted  as  reflecting  the  views  of  The  Rand 
Corporation  or  the  official  opinion  or  policy  of  any  of  its  govern¬ 
mental  or  private  research  sponsors.  Papers  are  reproduced  by  The 
RAND  Corporation  as  a  courtesy  to  members  of  its  staff. 
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In  summary,  we  find  that  if  one  variable  is  subject  to  measure¬ 
ment  error,  its  coefficient  and  t-statistic  are  asymptotically 
biased  toward  zero,  even  though  the  sign  of  the  asymptotic  bias  of 
its  standard  error  cannot  be  determined.  If  more  than  one  explana¬ 
tory  variable  is  subject  to  measurement  error,  the  sign  of  the  bias 
in  all  coefficients  and  t-statistics  cannot  be  determined  a  priori. 


II.  THE  SIMPLE  MODEL 


Consider  the  following  simple  regression  model: 


(2.1) 


yt  -  6  xi  +  ei 


where  y^  =»  vector  of  observations  on  the  dependent  variable, 

-  vector  of  observations  on  the  explanatory  variable, 
=  vector  of  disturbance  terms,  and 
t.  '  NID  (0,  a£2)  . 


However,  suppose  that  the  x1's  are  subject  to  measurement  error  such 
that  the  measured  independent  variable  x^  is  generated  by 

(2.2)  Xj  -  xi  +  ut 

where  u^  ~  NID  (0,  c^2)  —  and  E(x^  u^)  =  0  for  all  i,  j. 


Therefore,  the  following  model  is  estimated: 


iiy  assuming  that  measurement  errors  are  distributed  normally, 
we  can  snow  that  although  the  power  of  the  t  test  is  reduced,  the 
t  test  is  still  an  appropriate  rest.  If  the  u^'s  are  not  distributed 
normally,  then  we  cannot  even  say  that  the  t  test  is  appropriate. 
However,  the  remainder  of  our  results  (i.e.,  except  for  the  appli¬ 
cability  of  the  t  test)  still  hold  if  we  simply  assume  that  E(ui)  *  0 
and  i:(u,  u  )  =  0  for  1  #  j  and  »  for  i  =  j. 
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(2.3)  yi  *  b  xi  +  vi  * 


where  -  b  . 

It  is  a  well  known  result  that  the  least  squares  estimator  £  is 
an  asymptotically  biased  (toward  zero)  estimator  of  0.  This  is 
shown  below: 


b  » 


I  x 


i  >i 


1/ 


m  £(x+u) (Px+e) 
I(x+u)2 


P  £  x  +  £xe  +  PExu+Eu£ 

2  2 
lx  +  2£xu  +  £u 


Then,  if 


plim  4  I  x  e)  »  plim  (4  £  u  e)  *  plim  (—  £  x  u)  »  0 


2/ 


T-k» 


plim  (b) 


T-**>  T-x» 

plim  (^  P  £  x2) 

-  -  - 

plim  (^  I  x  +  ±  E  u  ) 


8 


2  2 
o  +o 
X  u 


(2.4) 


e  <TT7>  * 


here  plim  (=  £  x  )  ■  o  and  plim  (=r  £  u  )  *  o  m  X  a  i  X  >  0. 


Therefore,  the  estimator  b  is  asymptotically  biased  toward  zero.  In 
fact,  as  A  -*■  ®  ,  1/(1  +  A)  0  and  b  -*•  0. 

^For  notational  simplicity,  the  "i”  subscripts  are  hereafter  deleted. 

2 

Again,  for  notational  simplicity,  "plim"  will  be  written  "plim." 

T-x® 
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Next,  consider  the  estimator  of  residual  variance  in  the  EV  model 
(2.2)-(2.3) : 


(2.5)  cy2  -  |  I  (y  -  b  x)2 


«  |  E  y2  -  |  2  b  £  (x  +  u)  (6  x  +  e)  +  y  b2  E(x  +  u)2. 
Then,  through  manipulation  of  (2.5)  the  probability  limit  of  oy  may 


be  given 


, .  -  2  2  „2  ,  1  ,  2 

plim  ay  -  ay  -  fi  (y^Tj)  «x 


2  2  2  2  *  2 

Since  o£  “  °y  “  3  ax  »  the  estimator  oy  is  an  asymptotically  biased 

2 

(upwards)  estimator  of  .  The  asymptotic  bias  is  given 
plim  (o„2  -  oe2)  -  (yy-y)  S2  ax2 

Third,  consider  the  EV  model  estimator  of  the  variance  of  the  b 

coefficient.  The  EV  model  estimator  is  given  by 
*  2 

-  2  av 

ab 

X 


and  its  probability  limit  by 

-  2 

„  2  plim  oy 


(2.6) 


plim  a,  » 

D 


plim  o_ 


2  .  ,  A  „  ,2  2 

°e  +  (rrr)  8  ax 

(1  +  X)  o  2 


while  o 


8  o  2 

X 


Since  both  the  numerator  and  denominator  in  (2.6)  are  biased  upward, 
the  direction  of  bias  in  cannot,  in  general,  be  determined,  for: 
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(2.7)  pii»  <;b2  -  og2)  -  (xfx  - 

a 

x 

2 

2  a 
>0  if  B  >  (X  +  1) 

o 

X 


<0  if  B  <  (X  +  1) 

a 

x 

2  2  2 

Thus,  without  prior  knowledge  on  the  relation  between  B  and  oc  /o^  , 

-  2 

the  sign  of  the  asymptotic  bias  of  cannot  be  determined. 

Therefore,  one  might  conclude  that  since  the  "t"  ratio  is  the 
ratio  of  the  estimated  coefficient  to  its  standard  error,  the  direction 
of  bias  for  the  t  ratio  cannot  be  determined  a  priori  in  the  EV  model. 
However,  as  shown  below,  this  is  not  the  case.  For,  the  EV  model  the  t 
ratio  is  given  by: 


(2.8a) 


5o 


or 

.  b  a— 

(2.8b)  t.  - 

d  a 


and  plim  (t,  ) 


8  Cl-h-X1  *»1/2 

,  2  ,  c2  ,  A  .  2x1/2 

(oc  +  6  (r+r)  °x > 


(2.9) 


^  x-1/2 

B(1  +  A)  a  Bo 


where 


"-^TTa'A 

z 
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"nhat  is,  the  estimator  t^  is  asymptotically  biased  downward,  for  the 
numerator  on  the  left  hand  side  (l.h.s.)  of  (2.9)  is  biased  downward, 
while  the  denominator  on  the  l.h.s.  of  (2.9)  is  biased  upward.  Note 
that  since  u  in  normally  distributed  (see  the  footnote  on  p.  2), 

the  t  test  is  still  an  appropriate  test  of  H  :8  =  0,  because  under 

'  2  2  2  „  2  0 
Hq  b  is  NID  (0,  +  b  °u  )  and  is  chi-square  with  n-2  degrees 

of  freedom. 

III.  THE  TWO  EXPLANATORY  VARIABLE  MODEL 


Consider  the  following  two  explanatory  variable  regression  model 
(3.1)  y±  =  Bj.  xu  +  B2  x2i  +  e±  , 

where  x^  and  x2i  are  the  two  vectors  of  observation  on  the  two 
explanatory  variables.  However,  suppose  that  x^  is  subject  to 
measurement  error,  such  that  the  measured  explanatory  variable  is 
generated  by 


(3.2) 


vli 


Xli  +  Ui 


2 

where  NID  (0,  )  and  u^)  »  0  for  all  i,  j,  k.  Therefore 

the  following  model  is  estimated: 


(3.3)  y4  -  bx  xu  +  b2  x21  +  vx 
where  v^  «•  u^ 

Like  the  single  variable  case,  it  is  a  well  known  result  that, 
if  ard  x2  are  orthogonal  to  one  another,  the  estimator  b^  is  an 
asymptotically  biased  estimator  (toward  zero)  of  Sj_  and  the  estimator 
b2  is  a  consistent  estimator  of  32>  However,  results  when  x^  and  x2 
are  not  orthogonal  are  not  so  well  known. 
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To  consider  the  two  variable  model,  we  suppose  that  the  following 
probability  limits  exist  and  are  defined  as: 

plim  ^  Z  xx2  -  an 
1  „  2 

plim  f  r  u  -  auu  -  X  ou 

plim  y  1  “  au  +  auu  "  <1+x>  an 

plim  j  1  x2  “  c22 


(3.4) 


plim  j  E  Xj  x2  ■  o12 


1  _  2 

plim  y  E  y  -  a 

..  1  _  2 
plim  -  E  E  -  o 


yy 


■  a 

ee  e 


plim  •=  I  x.  u  *  0 


plim  i  £  *2  u  "  C 
plim  —  E  e  u  -  0. 


Then,  the  estimator  £ 


<y 


b  -  (X*  x)-1  X’  y 

/v<  vl 


is  defined: 


(X'  X)~A  X'  X  6  +  (X'  X)”1  X'  e 


where  X  -  (x.^,  x2i^  an<i  X  *  ^li’  x2i^ 
The  probability  limit  of  £  is  given 


plim  b  -  plim  [  (X'  X)-i  X  X  6]  +  plim  I(X'  X)  X'  e) 
-  plim  I (T-1  X’  X)"1  (T-1  X  X)  0) 


.  I  .0 

X  X  XX 
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where 

e__./ci+a)  «u 

X  X  1 

”i2).»d  s^-f11  °12\ 

\°12 

°22'  '°12  "ill 

Therefore, 


(3.5) 


plto  i  -  /*”“  'u  '»\  ( 

\°12  °22  j  \ 


all  °12 
a12  a22 


all  a22  ~  °12 


<WX)  °11  a22  -  °12 


X  an  o12 


(i+X)  on  a22  -  o12 


■(; :)(;) 
i 

\  62  +  Y  61  /, 


where 


all°22  "  °12 


<1+>  )  aU  °22  “  °12 


,  0  <  a  <  1  , 


and 


X  an  o12 


(1+X)  an  o22  -  o12 


2  • 


Therefore,  the  estimator  t.  is  an  asymptotically  biased  estimator 
(toward  zero)  of  .  However,  although  b2  is  an  asymptotically  biased 
estimator  of  b2»  the  sign  of  the  bias  cannot  be  determined  a  priori: 
b2  is  biased  upward  if  g^  and  have  the  same  sign  and  downward  if 
and  o^2  ere  opposite  in  sign.  Note  that  if  and  x2  are  orthogonal 
(i.e.,  0^2  “  0)  is  still  biased  toward  zero  but  62  is  an  unbiased 
estimator  of  S2  since  y  *  0. 
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.  2 

Hext,  consider  the  estimator  of  residual  variance,  0y  ,  in  the 
two  variable  EV  model: 

av2  -  (y  -  x  b) '  (y  -  X  b) 

-  y*  y  -  2  b*  X’  (X  B  +  e)  +  b’  X’  X  b  . 

*  2 

The  probability  limit,  as  T  +  »,  of  oy  is  given  by: 

(3.6)  plim  ov2  m  ayy  ~  2  b*'SXX  6  +  b*’(ZSC)  b*’ 
where  b*  *  plim  b.  Equation  (3.6)  may  be  rewritten 


=■  o^  +  [(«*)  a2  -  2a]  Bj2  an  -  2  Bj^  B2  o12  +  2y(a-l)  ou 


+  y 


2 


a22 


Since  o£2  -  a  -  au  -  2  ^  Bj  -  B.,2  a2J,,  the  asymptotic 
bias  of  oy2  is  given  by: 

plim  (oy2  -  oe2)  -  I(a-l)2  +  1  a2]  B^  ou  +  2y(a-l)  B*  0^  +  y2  6^  a 22 


1This  may  be  proved  by  substituting  in  for  y  and  a. 
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-  2  2 
That  Is,  o  Is  an  asymptotically  biased  (upward)  estimator  of  . 

The  estimator  of  the  variance  of  the  coefficients  in  the  two 
variable  EV  model  Is  given  by  the  diagonal  elements  of  the  matrix 


-  * }  (X'  X)"1 


The  probability  limit  of  may  be  written 


(3.7) 


while 


-  2 

plim  a,  «=  a 

D 


2  2 
°8  “°c 


,  .2  /V 5  \ 

v  y  (l+i)  o^/vj , 

)■ 


°22/D 

all/D 


where  D  -  det  (1^)  and  D  «=  det  (Ejjjj) 


Just  as  with  the  one  variable  EV  model,  the  sign  of  the  asymptotic 
bias  of  the  estimator  of  the  variance  of  the  coefficients  in  the  two 
variable  EV  model  cannot  be  determined  a  priori,  for  both  the  numera¬ 
tor  and  the  denominator  for  each  of  the  elements  in  (3.7)  is  biased 
upward . 

However,  in  spite  of  the  fact  that  the  sign  of  the  bias  of  the 
standard  errors  of  the  coefficients  cannot  be  determined,  we  are  able 
to  determine  the  bias  in  one  of  the  t-statistics.  Whereas  the  true 
t-8tatistics  are  given 


and 


‘  6i 


C  62 


bx  ex  rs 

°8l  ‘  ae  •  °2 
S2  02  ^ 


where  o 


i 


The  t-statistics  in  the  EV  model  are 
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and 


Then, 


b2  V 


plim  (t.  ) 
1 


a  6, 


a/^  i 


a  * 

v 


/022  av*  °1 


Since  a  «—  ,  plim  (t.  )  may  be  rewritten 
D  1 


-4-  77 


If  B 


(3.8)  plim  (t,  ) 
bl 


rs 

1  /I 


•*/~D  B, 


a  * 

v 


V  °2 


Therefore, 


(3.9)  | plim  (tfe  )  j  <_  |  tg 


/— r- 

since) -  I  <  1  and  a  *  >  a  . 

i  ^r__  —  v  —  e 


That  is,  the  t-statistics  for  in  the  two  variable  EV  model  is 
asymptotically  biased  toward  zero. 


On  the  other  hand. 


(3.10)  plim  (t.  )  - 


B2  +  y  Si 


2  o  *  V(l+1)  on/D 


which  suggests  that  the  sign  of  the  bias  of  t^  is  not  known  a  priori, 
for  &2  may  be  biased  upward  (if  gj  and  a12  are  the  same  in  sign)  or 
biased  downward  (if  gi  and  cr^  are  opposite  in  sign). 

Thus,  (3.9)  and  (3.10)  indicate  that  although  the  bias  in  the 
t-test  of  the  coefficient  of  the  variable  which  does  not  contain 
measurement  error  is  not  known,  the  t-test  of  the  coefficient  of  the 


i 
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variable  which  does  contain  measurement  error  is  biased  toward  zero. 

Again,  as  in  the  one  variable  EV  model,  the  t-test  is  an  appropriate 

test  of  the  null  hypothesis:  H  :  6.  *  0  because  under  H  ,  b.  is 
2  2  o  l  o  J- 

NID(0,  )  and  o is  chi-square  with  n-3  degrees  of  freedom. 

IV. _ THE  n- VAR TABLE  EV  MODEL 


Consider  the  following  multiple  regression  model 


(' *D  yi  =  81  Xli  +  82  X2i  +  •**  +  8n  Xni  +  ei  ’ 

where  the  x^'s  are  the  explanatory  variables.  However,  suppose  that 
is  subject  ;o  measurement  error,  such  that  the  measured  explana¬ 
tory  variable  is  generated  by 


(4.2)  xu  -  xu  +  ut  , 


where  u^^  *  NID  (0,  au  )  .and  E(x^  u^)  “  0  for  all  i»  j>  k. 


Therefore,  the ’ following  model  is  estimated 


(4.3) 


bl  xli  +  b2  x2i  +  b3  x3i  +  ”•  +  bn  xni 


where  v^  »  -  b^  u^. 

Again,  as  in  the  one  variable  model,  it  is  a  well  known  result  that 
b^  is  an  asymptotically  biased  estimator  of  8^  and  that  b.,  i  *  2,  n, 
are  unbiased  estimators  of  g  ,  i  ■  2,  n,  if  x^  is  orthogonal  to 
every  other  x^.  However,  to  our  knowledge,  results  have  not  been 
established  in  the  use  in  which  x^  is  not  orthogonal  to  the  remaining 
variables. 

This  n-variable  model  may  be  analyzed  by  partitioning  the  X 
matrix  into  two  parts:  and  every  other  x.  Thus,  X  =  (x^,  Xj) 

Xj  is  a  t  x(n-l)  matrix  of  t  observations  on  the  remaining  n-1  variables. 
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How  treating  as  in  the  two-variable  model,  we  can  obtain 

the  same  results  in  the  n-variable  model  as  in  the  two  variable  model: 

namely,  (1)  b.  is  biased  toward  zero,  (2)  the  sign  of  the  biases  in 

A  A  “  A 

b„,...  b  is  indeterminant,  (3)  the  t-ratio  for  b,  is  biased  toward 
i  n  -L 

zero,  and  (4)  the  direction  of  the  biases  in  the  t-ratlos  for  b2>...  b 
is  indeterminant. 

Notice,  however,  that  the  n-variable  model  considered  here  is 
not  the  most  general  case,  for  we  have  considered  only  one  variable 
with  measurement  error.  Zn  fact,  the  results  derived  here  will  not 
in  general  hold  when  more  than  one  of  the  variables  contain  measure¬ 
ment  error.  This  can  be  shown  by  returning  to  the  two  variable 
model.  Consider  the  two  variable  model  in  which  both  explanatory 
variables  contain  measurement  error: 


(4.4) 


xli  “  xli  +  uli 


X2i  "  x2i  +  “2i 


where  u^  '  NID  (0,  X^  Ojj). 


Then, 


plim  (b) 


hi  °2\/6l  \/°l  Bl  +  Y2  *l\ 

\Y1  °2/\®2  /  \a2  82  +  Y1  ei/ 


where  “ 


(?22  "  ^2 

2 

(1+A^)  (^^2^  ^11  °22  -  °12 


2 

(1^2^  ^11  °22  _  ai2 
a2  "  "  ~  2  • 
^11  a22  -  ai2 


X1  °11  712 

Y1 - 


2  ’ 

12 


(l+X1)(l-iX2)  on  o22  -  a 
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*2  °22  °12 

and  Y,  -  - 5- 

(1+A1)(1+A2)  a11  ct22  -  a12 

Therefore,  unlike  the  case  of  measurement  error  in  only  one  explana¬ 
tory  variable,  the  direction  of  bias  in  the  coefficients  can  no  longer 
be  determined  a  priori.  As  a  result,  the  previous  conclusions 


regarding  the  t-statistic  are  also  no  longer  valid. 


